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ABSTRACT The breeder's equation is a cornerstone of quantitative genetics, widely used in evolutionary 
modeling. Noting the mean phenotype in parental, selected parents, and the progeny by E(Z 0 ), E{Z W ), and 
E{Z<\), this equation relates response to selection R = E{Z^) - E(Z 0 ) to the selection differential S = E{Z W ) - 
E(Z 0 ) through a simple proportionality relation R = h 2 S, where the heritability coefficient h 2 is a simple 
function of genotype and environment factors variance. The validity of this relation relies strongly on the 
normal (Gaussian) distribution of the parent genotype, which is an unobservable quantity and cannot be 
ascertained. In contrast, we show here that if the fitness (or selection) function is Gaussian with mean /jl, an 
alternative, exact linear equation of the form R' = j 2 S' can be derived, regardless of the parental genotype 
distribution. Here R' = E{Z<\) - \x and S' = E{Z W ) - \x stand for the mean phenotypic lag with respect to the 
mean of the fitness function in the offspring and selected populations. The proportionality coefficient j 2 is 
a simple function of selection function and environment factors variance, but does not contain the genotype 
variance. To demonstrate this, we derive the exact functional relation between the mean phenotype in the 
selected and the offspring population and deduce all cases that lead to a linear relation between them. 
These results generalize naturally to the concept of G matrix and the multivariate Lande's equation 
Az = GP~ 1 S. The linearity coefficient of the alternative equation are not changed by Gaussian selection. 
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The breeder's equation for the evolution of quantitative traits for 
additive genetic effects, introduced by Lush (Lush 1943), is widely 
used both in artificial and natural selection theory and experiments 
(Lande 1976; Falconer and Mackay 1995; Lynch and Walsh 1998; 
Heywood 2005) and appears in all textbooks of quantitative genetic. 
This equation can be stated as follow: consider a continuous pheno- 
typic trait Z subject to selection. Noting the mean phenotype in pa- 
rental, selected parents and the progeny by £(Z 0 ), E(Z W ) and E(Zi) y 
we can define the selection differential S = E(Z W ) — E(Z 0 ) and the 
response R = £(Z X ) — E(Z 0 ). The scalar breeder's equation reads R = 
h 2 S and ascertains that the response to selection and the selection 
differential are related through a proportionality relation that is the 
ratio of genotype to phenotype variances, h 2 . The equation naturally 
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extends to selection on multiple traits and its vectorial version reads 
Az=GP~ l S 

Use of the breeder's equation and its underlying assumptions has 
been criticized by many authors (Kruuk 2004; Heywood 2005; 
Pigliucci 2006; Gienapp et al 2008; Pemberton 2010). One fundamen- 
tal assumption of the breeder's equation is the normal (Gaussian) 
distribution of the breeding value (genotype) and environment factors. 
Authors who demonstrate the linear relation (Kimura and Crow 1978; 
Lande 1979; Lande and Arnold 1983; Nagylaki 1992; Falconer and 
Mackay 1995; Lynch and Walsh 1998; Crow and Kimura 2009) as- 
sume normal distribution for the aforementioned quantities or the 
analogous hypothesis of linearity of the parent— offspring regression 
(see Appendix/Parent— offspring regression). When this assumption 
is relaxed, the breeder's equation is no longer valid, and one has to 
resort to a system of hierarchical moment (or alternatively, cumulant) 
equations to describe the changes; in general, this system is not closed, 
and the moments of a given order depend on moments of higher 
order (Turelli and Barton 1990). 

The assumption of a Gaussian distribution of the genotype can be 
criticized on several grounds (Pigliucci and Schlichting 1997; Pigliucci 
2006; Geyer and Shaw 2008). For example, the very act of selection 
causes the genotype distribution to deviate from a Gaussian (Turelli 
and Barton 1990; Turelli and Barton 1994) (see also equation 6 
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below). Another important case is when the genotype is a cross be- 
tween different breeds due to external gene flow or the breeder's 
scheme. In many cases, the phenotype can have a bell shape and thus 
is assumed to be Gaussian, when the genotype is indeed far from it 
(see, for example, Figure 2A). It is sometimes argued that even if the 
breeding value does not follow a normal distribution, a scale can be 
used to restore it to a normal distribution. Such a scale, however, will 
also distort the distribution of environment factors and the assump- 
tions of breeder's equation are violated even in this case. 

For additive genetic effects and in the absence of epistasis and 
dominance, I derive here a precise functional relation between the 
mean of the trait in the selected subpopulation and in their progeny 
for the general case. The mathematical formulation is close to the 
framework used by many authors such as Slatkin, Lande and Karlin 
(Slatkin 1970; Karlin 1979; Lande 1979). I then use a standard tool of 
functional analysis, the Fourier transform (FT), to deduce all the cases 
that lead to a linear relation between the response R and the selection 
differential S, regardless of the selection function. These cases imply 
a precise form of the distributions of genotype and environment 
factors, and I show that the proportionality factor between R and S 
is the heritability coefficient h 2 only if these distributions are normal. 

The genotype, however, is not observable or controllable, and its 
normal distribution cannot be assumed a priori I show that if instead 
of the genotype, the fitness function and environment factors are 
Gaussian, then a new proportionality relation can be obtained in 
the form of 

R' =j 2 S' (1) 

regardless of the genotype distribution. Noting the mean of the 
Gaussian selection function by /ul, R' = E(Zi) — /m and S' = E 
(Z w ) — fJL are the mean phenotypic lag with respect to the mean 
of the fitness function of the progeny and the selected population 
(Figure 1). As E(Zi) y E{Z W ), and fx are all measurable, R' and S' are 
both measurable in the same way as R and S are. The j 2 coefficient 
contains only the width of the fitness function and environment 
factors. The use of a Gaussian selection function, both in artificial 
and natural selection (as an approximation of stabilizing selection), 
is widespread (Lewontin 1964; Lande 1976; Kimura and Crow 1978; 
Zhang and Hill 2010) and the aforementioned relationship is poten- 
tially as useful as the standard breeder's equation. 

The advantage is more critical when the breeder's or Lande's 
equations are used in long-term evolution, where the variance of 
the genotype (or the G matrix) also varies and h 2 cannot be assumed 
to remain constant (Gavrilets and Hastings 1995; Pigliucci and 
Schlichting 1997; Roff 2000) ; in contrast, the relation (1) remains 
valid if each round of selection uses a Gaussian fitness function. 

The aforementioned results generalize naturally to multivariate 
trait selection where the alternative Lande's equation is 

R' = (H + E)Q~ 1 S' (2) 

where R' and S' are the vectorial phenotype lag, and ft and E are the 
covariance matrices of the fitness function and the environment 
respectively. 

This article is organized as follows: in the Results section, I first 
derive the general functional relationship between R and S; the second 
subsection is devoted to all the cases where these two quantities can be 
linearly related, including the special case of the breeder's equation. 
The alternative breeder's equation is derived in the third subsection, 
and all the results are generalized to selection on multiple traits in the 



fourth subsection. The aforementioned results are put into perspective 
in the Discussion section. Technical details, such as the use of FTs, are 
treated in the Appendix. 

RESULTS 
General results 

Consider a continuous phenotype Z, which is the result of additive 
genetic effect Y and the environment £ (Fisher 1918; Lynch and Walsh 
1998; Visscher et al 2008) 

z = r + £ 

The term environment encompasses here any source of noise that 
causes the observed phenotype z to deviate from the (unobserved) 
breeding value y (Wright 1920; Lynch and Walsh 1998; Raj and van 
Oudenaarden 2008). In the following, the population distribution of 
the breeding value (genotype) and its variance in the parental gen- 
eration are denoted p 0 (y) and a\. The environment effect is cap- 
tured by the distribution law f(z\y), the probability density of 
observing phenotype z with the given genotype y. We will suppose 
that / is a symmetric function of its argument of the form f(z\y) = 
f(z — y) and denote its width by cr|. 

A subpopulation among the parental generation is selected accord- 
ing to a fitness or selection function W(z), the proportion of phenotypes 
in [z, z + dz] to be selected for the production of the next generation. 
The selected individuals produce offspring which will constitute the 
next generation. As we will show herein, the response R (the mean of 
the phenotype trait in the offspring) and the selection differential S (the 
mean of the phenotype trait in the selected parents) are given by 

R = E{Z X ) = ^JJ R2 yPo(y)W(z)f(z-y)dydz (3) 

S = E(Z w ) = l- [[ zp 0 (y)W(z)f(z-y)dydz (4) 

where W is the mean fitness of parental generation. Equations (3) 
and (4) are used, for example, by Lande (1979), although their 
derivation there depended on the normal distribution of the geno- 
type. I derive these equations here for the more general case. 

Before going into the details of calculations, note that the genotype 
distribution po(y) and the selection function W(z) play a symmetric 
role in the aformentioned expressions. In the following sections, we 
will explore specific functional forms of p 0 (y) and W(z), which lead to 
a linear relationship between R and S. Because of the symmetric role 
of these two functions however, once a particular relation is obtained 
for a specific form of po(y) regardless of W(z), an analogous relation- 
ship can be obtained for a similar form of W(z) regardless of po(y). 
This is what leads us to an alternative form of the breeder's equation. 

Let us now derive the equations (3,4). We note that the 
distribution of the phenotype Z in the parental generation is given by 

qo(z) = [ Po(y)f(z\y)dy (5) 

We will denote its variance by <j 2 p . 

The distribution of the phenotype z in the parental population 
selected according to the fitness function W(z) is 
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Figure 1 Schematic representation of the selection lag S', the response 
lag R f , and their relation to the selection differential S and the response 
R. The mean phenotype of parental generation zq, selected population 
z w , the progeny z\, and the peak of selection function \x are repre- 
sented on the phenotype axis z. Dashed curves represent a sketch of 
the distributions of parental phenotype qo(z), selected parents q w {z), the 
progeny qi(z), and the selection function W(z). 



where W is the mean fitness of the parental generation 

W = J R q 0 (z)W(z)dz 

= If^Po{y)W(z)f(z\y)dydz 

The genotype distribution of the selected population is (Turelli and 
Barton 1994) 

Pw(y) = 4r / Po(y)f(z\y)W(z)dz (6) 
W T Jr 

= ^rPo(y)W^{y) (7) 

where 

W*(y) = [ W(z)f(z\y)dz (8) 
Jr 

is the genotype fitness function, i.e., the convolution of the pheno- 
type fitness function by the environment factors. is the mean 
genotype fitness: 

W ] = kPo(y)W^(y)dy 

= Sf-B?Po(yW(z)f(z\y)dydz 

Note that W = as both these quantities are defined by the same 
double integration over the domains of y and z. 

For a large, randomly mating population, reproduction gives for 
the distribution of breeding values in the next generation (Slatkin 
1970; Karlin 1979; Bulmer 1985; Turelli and Barton 1994) 

Pi(y)= // Pw(y a )pw(yb)L(y- (y a +yb)/2)dyady b 

JJr 2 

The exact form of the probability density L(y) that captures the 
inheritance process (recombination, segregation, . . .) is not impor- 
tant here; Turelli and Barton (1994), for example, use a normal 
distribution for L(y) in the framework of the infinitesimal model. 
For our purpose, it is enough to suppose that the mean of the 
distribution L(y) is zero, i.e., J y yL(y)dy = 0 which is valid in the 
absence of dominance and epistasis effects (Turelli and Barton 1990) 
(see also Appendix/Segregation density function) . 
The phenotype distribution of the progeny is 

/ *pi(y)KAy)dy (9) 

JR 



We now make the further assumption that (1) the environment 
and genotype are independent random variables, so that f(z\y) = 
f(z — y) and therefore the variances are additive: a 2 p = ct\ + <x| and 
(2) environment effects are of zero mean (J x xf(x)dx = 0) and sym- 
metric (f(—x) =f(x)). An environmental noise with such a distribu- 
tion law does not change the mean of the random variable: E(Z) = 
E(Y + £) = E(Y). Therefore, the mean phenotype of the offspring is 

R = E{Z 1 )=E{Y 1 ) 

= km<j)dy m 
= ( l l 2 )\h:-{ya +yb)pw{y a )pw{yb)dy a dy b 
= J R ypw(y)dy 

= ^ JJ^ypo(y)w(z)f(z ~ y)dydz (11) 

which is equation (3). Note that the first lines of the above equations 
merely state that the expectations of the breeding's value of parent 
and offspring are equal for purely additive traits. 

On the other hand, the mean phenotype of the selected parents is 

S = E(Z W ) = J R zq w (z)dz 
= wk z ^ z ) w ( z ) dz (12) 

= wffR> z Po(y) w (z)f(z-y)dydz 

which is equation (4). 

For an asexually reproducing organism, or for a sexually repro- 
ducing population which remains at Hardy- Weinberg equilibrium 
after selection-reproduction, we would havep^y) = p w (y) ; this would 
again lead to the same equation (10) and the same response (11). The 
conditions for the existence of multilocus Hardy-Weinberg equilib- 
rium were analyzed by Karlin and Liberman (1979a,b), who con- 
cluded that for additive traits, the equilibrium is stable for a wide 
range of recombination distributions. The general relation between 
R and S can also be studied in the context of the Price equation. 
A detailed study of this relation has been performed by Heywood 
(2005). 

Conditions for proportionality of R and S 

The relations (3) and (4) show that the selection differential S and the 
response R to it are related through a functional equation involving 
three factors: genotype distribution, the selection function and the 
environmental noise. It is far from obvious that R and S could be 
proportional, a question we will investigate by using FTs. 

FTs in functional analysis play a role analogous to logarithms in 
algebra. They are useful for clarifying the R — S relation, where we can 
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transform the double integrations into simple ones. The FT of the 
function u(x) is the function u(k) denned as (see Appendix/ Fourier 
Transforms) 



u(k) : 



/OO 
-00 



-ikx 



u(x)dx. 



For example, the FT of the function u(x) = exp(— a\x\) is 
u(k) = 2a/ (a 2 + k 2 ). Part of the usefulness of FT is due to the fact 
that they transform convolution products into simple products: 
given two functions u(x) and v(x) and their convolution product 

h(z): 



Hz) 



i: 



u(x)v(z — x)dx 



the relation between their FT is a simple product: 

h(k) = u(k)v(k) 



As the general relations (11) and (12) involve convolutions, FT 
proves to be very useful in their handling. Using the various 
properties of FT (see Appendix/Fourier Transforms), the relation 
between R and S in the Fourier space reads: 



dk 1 



(13) 



and 



S = ^k^( k )i[h(k)f(k)]dk 
= R + ^wk^( k )Po( k )M(k)]dk 
where the mean fitness W is itself denned in Fourier space as 



(14) 



W 



±- [ W%k)p 0 (k)f(k)dk. 
27T Jr 



- 1 and we have 



Here a* designate the complex conjugate of a, v 
set the origin of the breeding values at its mean in the parental 
population, i.e., f R ypo(y)dy = 0. 

In general, the FT of a function is complex. However, as the 
function in direct space here are real, it can be shown that expressions 
(13) and (14) are indeed real; the fact that i appears in these expres- 
sion insures this fact (see Appendix/Fourier Transforms). It is worth- 
while to consider a particular case to clarify the above expressions. The 
detailed computations for a truncation selection in which breeding 
value and environmental factors are normally distributed are provided 
in Appendix/Truncation selection. 

We see from equations (13) and (14) that S and R can be pro- 
portional if the second term of the r.h.s. of equation (14) is propor- 
tional to R; this will be true, regardless of the selection function W, if 



df(k) 
dk 



(15) 



where a is an arbitrary constant. Equation (15) is the necessary and 
sufficient condition that defines the functional shape of the genotype 



distribution and the environment noise compatible with the propor- 
tionality of R and S regardless of the selection function. If condition 
(15) is fulfilled, then 

R= (l + a) _1 S 

On the other hand, equation (15) can be seen as a differential 
equation whose solution is given by 



/(*) = bp 0 (k) a 



(16) 



where b is another arbitrary constant. Let us consider some partic- 
ular case where the aforementioned relation is obeyed. 

Normal distributions 

Iff(k) andp 0 (/c) are both Gaussians, i.e., 

f(k) = ex V {-* 2 E k 2 /2) 

p 0 (k)=exp(-cr 2 A k 2 /2) 
then the relation (16) is satisfied by 

a = °eI°a 

and we retrieve the usual breeder's equation R = h 2 S where 
h 2 = (T A /(a A + o- 2 E ). Of course, if f(k) and p 0 (k) are of the above 
form, their inverse FTs represent normal distributions of width a E 
and cr A respectively (see Appendix/Fourier Transforms). 

Stretched exponentials 

We see, however, that even if the strict condition (16) is fulfilled, the 
proportionality constant need not be h 2 . Consider, for example, the 
class of stretched exponential functions 4>(k) = exp( — \k\ a ), which 
generalizes Gaussians (case a = 2). Set f(k) = 4>(aEk), p 0 (k) = 
4>(cr A k). The inverse FT of these functions gives the distribution of 
the genotype Y and environment effect E and it is straightforward to 
show that as for the Gaussian case, Var(£)/Var(7) = cr E /cr A . Con- 
dition (16) however is satisfied this time with a = cr^/a A and there- 
fore the realized heritability h a = R/S is 



h a 



The aforementioned examples were to emphasize the fact that 
selection-independent proportionality is achieved only for particular 
pairs of genotype/ environment distributions. In general, as shown in 
Figure 2, the realized heritability is not constant and depends crit- 
ically on the selection function W(z). 

Alternative breeder's equation 

Optimal phenotypic selection approximated by Gaussians has been 
considered by many authors both in artificial (as early as Lush 1943) 
and in natural selection (as early as Wright 1935; Haldane 1954) and it 
is widespread in the literature (Lewontin 1964; Lande 1976; Kimura 
and Crow 1978; Karlin and Liberman 1979a; Zhang and Hill 2010). If 
the selection function is Gaussian, a new linear relation can be 
extracted from the general relations (3) and (4), regardless of the 
(unobservable) breeding value distribution. 

Note that a symmetric role is played by W(z) and po(y) in the 
general expressions (3) and (4). Hence permuting their role will lead 
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Figure 2 A simple example in which R/S ^ h 2 . (A) The parental breed- 
ing value distribution (thin red line) is a double Gaussian p 0 (y) = C/V(m, s; 
y) + J\f(-m, s; y))/2; the environmental effects distribution (thin blue line) 
follows a normal distribution f{x) = Af{0, a E ; x). The phenotype distribu- 
tion q(z) (equation 5), (thick black line), has the appearance of a normal 
distribution. The result of a truncation selection, selecting only and all 
individuals with phenotype value > z 0 is shown in (B). (B) Right scale: 
The Response R(red line, circle) and the Selection differential S (orange 
line, triangle) as a function of the truncation selection z 0 . Left scale: the 
value of R/S (thick black line) as a function of z 0 and its comparison to h 2 
(thin dashed line). All integrations (equations 11 and 12) can be per- 
formed exactly for this case: R{z 0 ) and S(z 0 ) are combination of Gaussian 
and erf(z 0 ) functions, their exact expressions are given in Appendix/ 
Computation of truncation selection. The parameters of the figures 
are m = 3, s = 2 and cr E = a a = Vm 2 + s 2 = \ZV3, therefore h 2 = 1/2. 



us, following the same line of arguments, to deduce all linear cases 
regardless of genotype. Equations (3) and (4) are obtained by multi- 
plying the function F(y, z) = W(z)p 0 (y)f(z — y) either by y or z and 
integrating over R 2 . To obtain the breeder's equation of the previous 
section, we wrote the integration over the y variable as a convolution 
product and performed the FT on the z variable. 

On the other hand, we could have proceeded by writing equations 
(3) and (4) first as a convolution product on z and then perform a FT 
on the variable y (see Appendix/Fourier Transform). In this case, we 
get 

s = ^J/Kk^ k »^ dk (17) 

and 

R =2^S/ {k) i mkmdk (18) 

The arguments of the previous section can be repeated. Let us center 
the selection function by setting W(z) = W c (z — fi) where 




Then 



s ' = < s - "> = l/ {k)e ~ ik "i WW* ^ 

and 

R ' = <* ~ = l/^i t^ (fc)/(fc) ] * (20) 

The quantities S' and R' are alternative selection differential and 
response and represent the lag with respect to the mean of the 
selection function (Figure 1). In the case in which the selection 
function and the environment factors are both normally distributed 
with width a w and a E , a repetition of the arguments of the previous 
sections leads to 

R' =j 2 S' (21) 

where 

•2 _ °W + °E 

We stress that relation (21) is obtained regardless of the unknown 
genotype distribution p 0 (y). 

The alternative breeder's equation (21) may seem unusual as it 
does not contain the genetic variance. Such a result may seem at first 
glance in contradiction with our basic understanding of the selection 
process. Fisher fundamental's theorem for example explicitly relates 
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the rate of increase in fitness to the genetic variance. There is, how- 
ever, no contradiction: Both R' and S' are dependent on the genetic 
variance, as can be seen in the general equations (3) and (4) ; however, 
their ratio, i.e., the coefficient of the linear equation (21) relating them, 
is free of genetic variance. A similar situation occurs for the classical 
breeder's equation, where both R and S depend on the selection func- 
tion W(z) but their ratio contains only the heritability coefficient, 
independently of W(z). 

Equation (21) has been obtained through the tools of functional 
analysis and its demonstration may seem a little abstract. It is 
worthwhile to further illustrate this equation by considering few 
examples where the computations can be carried out explicitly. Let us 
designate the normally distributed selection function W(z) and the 
environment factors as: 



S = afji (25) 

R = ah 2 ii (26) 
where a = (j 2 — l)/(/ 2 — h 2 ). Therefore, 

R' _ R - fi _ ah 2 - 1 _ . 2 
S 7 ~~ S- ijl ~ a - I ~ J 

and equation (21) is verified. 

Note that in the aforementioned case, R and S are both propor- 
tional to the mean of the selection function fi. Applying a Gaussian 
selection function can therefore be used as a test of the normal dis- 
tribution of the breeding values. 



W(z) = Af(iuL,(T W ;z) 
where N(a, b\ u) = (l/ v / 27r^)exp( - (u-a) 2 /2b 2 ). 



No genetic variance 

The first example we consider is the extreme case in which there is no 
genetic variance (crA = 0) in the parental generation. The distribution 
of the breeding value then becomes a Dirac's delta function p0(y) = 
8(y). The basic rule of Dirac's delta, i.e., Jj8{y)cl){y)dy = 0(0) reduces 
the double integrations of equations (11) and (12) to simple integra- 
tions which involve only Gaussian functions. Note that for the general 
case, reduction of double integration to simple one was achieved by 
the use of FTs. The value of R and S are therefore readily obtained in 
this case: 



W - 



[ W(z)f(z)dz 



exp 



2tt 



R = 0 



zW(z)f(z)dz = n- 



(22) 

(23) 
(24) 



As expected, in the absence of genetic variance, there is no response 
to selection. The response and selection lag R' and S' read: 



R' = R- fjt 



c> c W 

°W + a E 

Therefore, R' = j 2 S\ and equation (21) is verified. This example 
shows explicitly that there is no contradiction between the alterna- 
tive breeder's equation and Fisher's fundamental theorem. 

Gaussian breeding values distribution 

Let us now consider a less-extreme case in which there exists a normal 
genetic variability 

p 0 (y) =Af(0,a A ;y) 

The double integrations (11,12) giving S and R can again be carried 
out exactly, as all the integrands are Gaussian: 



Non-Gaussian breeding values distribution 

Let us now consider a case in which parental breeding values are not 
normally distributed but are concentrated around two particular 
values: 



-cta)) 



(27) 



and therefore E(Y 0 ) = 0 and Var(Y 0 ) — o\. The computation of the 
expressions (11) and (12) can be again carried out exactly: 



\xcr\ + a A cr 2 w tanh 



R = a A 



tanh [ - 



a A fi 



We note that in this case, the ratio R/S ^ h 2 and the classical 
breeder's equation does not hold. The alternative breeder's equation 
however is again verified: 



R' 

S 7 



R- 



v 2 w + i 



'w 



Figure 3 illustrates the accuracy of the alternative relation compared 
to the usual breeder's equation for this case. 

The aforementioned few examples were to illustrate the alternative 
breeder's equation. Many other examples of the breeding value dis- 
tributions, such as a double Gaussian or a rectangular function can be 
computed exactly and lead of course always to the alternative breeder's 
equation. The last example can indeed be generalized and used as an 
alternative demonstration of equation (21), as any function can be 
seen as a superposition of Dirac's deltas: po(y) = J R po(u)8(u — y)du. 
The demonstration we provided using FT is, however, more 
straightforward. 

Selection on multiple traits 

The results of the aforementioned sections are naturally generalized to 
selection on multiple traits. Consider the vectors of parental breeding 
values y 0 = (y^ y 2 , . . ., y^)* environmental effects e = (e lf . . ., e N ) and 
their phenotype z 0 = y 0 + e, to which a selection function W(z) is 
applied. Using the same notations as in the previous sections, we find 
without difficulty that 

zi= / ypo(y)W(z)f(z-Y)dydz 

JR N xR N 
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Figure 3 Gaussian selection function: deviation 8 from theoretical pre- 
diction of the breeder's and alternative equations as a function of the 
mean of the selection function /x. The breeding value distribution cor- 
responds to equation (27). Black line (squares): (R - /x)/(S — fi) — j 2 ; 
red line (circle): R/S - h 2 . Parameters used here are cr A = 2, a w = 1 and 
a E = 3 and therefore h 2 = 4/13 and j 2 = 10. 



z w = zp 0 (y)W(z)f(z-y)dydz 

JR N xR N 

As before, using FT, these relations transform into 
zi=^/ W*(k)(Vp 0 (k))f(k)d M k 

Z7T J R n 

*» = 4zl ^*(k)V(p 0 (k)/(k))d M k 

ITT J R n 

where V is the gradient operator: V/ = (df/dxi, . . . df/dx N ). We see 
again that Z\ and z w are linearly related if 

p 0 (k)(V/(k))=A(Vp 0 (k))/(k) 

where A is a constant matrix. The linear relation is automatically 
satisfied if both p 0 and / follow a Gaussian distribution 

Po(y) oc exp^-^y r G- 1 y^) 

f(x)cc Qxp ^- l - x T E -^ 

where G and E are the covariance matrices for the genotype and 
environmental effects. Denning P = G + E as the phenotype co- 
variance matrix, it is straightforward to show that in this case A = 
EG~ l and therefore (Lande 1979) 

Zi = GP~ l z w 

which is the usual breeder's equation for multiple traits. We stress 
that the limitation of this relation is the same as that of the scalar 
version: it relies on the normal distribution of the genotype. On the 
other hand, if the selection function W(z) is Gaussian 

W(z) oc exp f - l -(z-ix) T £l-\z - 



the arguments of the previous section 2 can be repeated and lead to 
the generalization of the alternative vectorial breeder's equation (21) 

zi - fi = (12 + (z w - fi) 

which, in analogy with equation (21) we write as 

r' = (n + £)n _1 s' 

DISCUSSION AND CONCLUSION 

The breeder's equation is a cornerstone of quantitative genetics and 
appears as a fundamental equation in all the important textbooks of 
this field (Lynch and Walsh 1998; Falconer and Mackay 1995; Crow 
and Kimura 2009). It is widely used in artificial selection (Lush 1943; 
Hill and Kirkpatrick 2010); its usage in natural selection was popu- 
larized by Lande (1976), when he formalized the main idea of phe- 
notypic evolution and it is now commonly used in many articles based 
on Lande's work (see, for example, Hansen et al 2011; Manna et al 
2011; Svardal et al 2011). The mathematical foundation of this equa- 
tion rests upon the hypothesis that the breeding value is normally 
distributed. This hypothesis is plausible for a continuous trait in 
a population not subject to selection (see, however, Appendix/ Segre- 
gation density function). The normal distribution of the breeding value 
is more fragile in populations subjected to selection on this trait 
(Turelli and Barton 1990), as the genotype of selected parents is given 
by (equation 7) 

where W^(y) is the genotype fitness function denned by equation (8). 
Even if po(y) were Gaussian, the very act of multiplying it by an 
arbitrary function makes p w (y) y and hence pi(y) non- Gaussian. 
Therefore after the first round of selection, the normal distribution 
hypothesis of parental genotype cannot be sustained. Turelli and 
Barton (1994) have shown that for the infinitesimal model, the 
non-normality may not have large effects on the predictions of the 
breeder's equation, but they argued that when the number of loci is 
limited the discrepancy can grow much larger. Of course even po(y) 
cannot be assumed to be Gaussian if different breeds are crossed to 
constitute the parental generation, which happens in artificial selection and 
in natural selection when gene flow from nearby patches is important. 

The breeding value is not an observable quantity. The fitness or 
selection function W(z) is more quantifiable and many authors have 
considered a Gaussian selection function. In artificial selection, it dates 
back at least to the work of Lush (Lush 1943), pi 40). In natural 
selection, it is used by most authors as a model for stabilizing selec- 
tion. If Gaussian selection is used to evolve a population, then the 
alternative breeding equation (21) we derived is more precise and 
predictive and rests on more robust mathematical grounds while 
retaining the same simplicity of the standard breeder's equation. Note 
that the analysis of this article is not restricted to the infinitesimal 
model, but applies to all inheritance processes involving purely addi- 
tive genetic effects. The alternative breeder's equation generalizes to 
selection on multiple traits in a way similar to the standard breeder's 
equation and can therefore be incorporated in the "adaptive land- 
scape" formalism (Arnold et al 2001) with the same ease. 

In conclusion, we believe that in all cases where Gaussian selection 
functions are used to evolve a population, the alternative breeder's 
equation we develop above is a useful alternative approach to the 
standard method. 
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APPENDIX 



FTs and convolutions 

Logarithm was invented to simplify algebraic operations: a multiplication in the direct space transforms into an addition in the logarithm space, 
were it can be performed easily and the result brought back to direct space via the inverse transformation. FT plays a similar role in functional 
analysis, where derivation/integration of functions in direct space are transformed into multiplication/division by the variable in the Fourier 
space. 

The FT of a function f(x) is denned here as (Byron and Fuller 1992) 

/+00 
f(x)e- ik *dx 
-00 

where i 2 = —1. For example, the FT of the function f(x) = exp(— a\x\) is 

/(*) = J-oo e( a ~ ik ) x dx + / 0 °° e~( a+ik Ux 

_ 1 _j L_ — 2a 

a — ik a+ik a 2j rk 2 ' 

The main properties of FT we use here are 
1. ParsevaVs theorem 

/+00 , |>+00 

_ f*(x)g(x)dx = — J ^ f (k)g(k)dk 

where a* stands for the conjugate complex of a. Note that iif(x) is a real function, the/* (k) = f( — k), which ensures that the right hand side of 
the above expression is always real if both functions / and g are real. 



2. Derivation property 



3. Convolution property 



where 



i-^-Hk) = / xf(x)e~ ikx dx = TF[*f (*) 
dk J _oo 



FT[(/*^)W] = FT\f(x)].FT\g(x)] =f(k).g(k) 



/oo 
f(u 
-00 



4. Translation property 



(f*g)(x) = / f(u)g(x-u)du 



FT[/(*- M )]=e"^FT[/(* 



On the basis of the aforementioned properties, and the fact that all the aforementioned functions are real i.e., for example W*(z) = W(z), we see 
that relation (3) can be written as 

JJ R iypo(y)W*(z)f(z - y)dydz =f R W*(z)(yp 0 *f)(z)dz 

= ^k^Wi[p(k)]f(k)dk 

where we have used the fact (i) that FT[yp 0 (y)] = (p ' (k) ; (ii) FT transforms a convolution product into a simple product in reciprocal space 
and (iii) Parseval's theorem. 
The same set of rules leads to 

ff R 2Zpo(y)W*(z)f(z-y)dydz =f R W*(z)z(p 0 *f)(z)dz 



bk^^)ikW)m]dk 
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Note that we can exchange the order of integration on y and z, write the first integral as a convolution product on functions of z and proceed 
to the second integral by using the FT on y. For R, we have 

^yp 0 {y)W(z)f(z-y)dydz = f R p* 0 (y)y(W*f)(y)dy 

and for S we get 

^zp 0 {y)W(z)f(z-y)dydz = J R p* 0 (y)(zW *f)(y)dy 

= ^kmi[W(k)]f(k)dk 

The translation property was used in the derivation of the functional lags (eqs 19,20). 
Finally, note that the FT of a Gaussian is a Gaussian: 



FT 



2775 



exp(-x 2 /(25 2 ) 



: exp(-s 2 k 2 /2) 



Computation of truncation selection with FT 

Consider a normal breeding value and environmental factor distribution 

Po(y) =N(0,a A ]y) 
f(x)=N(p,<TE\x) 

where Af(a, b\ u) = (1/ v / 27rfr)exp( — (u— a) 2 /2b 2 ) to which we apply a truncation selection W(z), where W(z) = 1 if z 0 < z < Z\ and 0 other 
wise. The FT of these functions read: 



The mean fitness of the parental generation is 



~p Q (k) = e-^/2 
f(k) = e-°**#l 2 



W(*i,Zo) = § R 2Po(y)W(z)f(z\y)dydz 
= ^kW%k)p 0 (k)f(k)dk 

erf fe)) 



erf 



Zi 

V2A 



where A 2 = a\ + oi and erf(w) is the error function. As 



± \ e sW/2 

dk I 



■s 2 qe-^ 2 / 2 



the computation of R and S, using their expression (13,14) is simple (the integrands are exact differentials) and leads to 



1 cri 



S = 



W^/2ttA 



/2A 2 _ e -z\/2tf 



-zt/2A 2 _ p -z 2 j2A 2 



and it is trivially verified that R/S = a\/ A 2 = h 2 . 

The same computations can be extended to the case where the breeding value distribution is a double Gaussian (Figure 2): 
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PoO) = \ (^( -m, s\y) + N(m, s;y)) 



p 0 (k) = ^(e imk + e~ imk y- s2k2 / 2 

The computations, although more cumbersome, involve the same level of technicality. For simplicity, we give the result only for extreme 
truncation z x — > oo (selecting all individuals with phenotype > z 0 ). Noting 

•w-K—OSr)) 

B(m) = *=e-to ~ + -erfc ( Z -^™) 

Where A = s 2 + cr|. The selection and response function read: 

A(m) + A( -m) 
= W(m) + W(-m) 

g _ B(m)+B(-m) 
W(m) + W(-m) 

The simple Gaussian case can be recovered from these expressions by m = 0 and a A = S. 
Parent-offspring regression 

The derivation of the breeder's equation sometimes uses the parent-offspring regression coefficient as an intermediate(Nagylaki 1992; Lynch and 
Walsh 1998). The linear regression between parent and offspring phenotype however is based on the same assumption of normal distribution of 
genotype and environmental factors. 

The probability density of observing the phenotype z' in the offspring and z a > Z\> in the parents is 

p(z';z a ,z b ) = § R 2p(ya)f(za\y a )p{yb)f(zb\yb)L(yi - (y a +y b )/2)f(z'\y 1 )dyidy a dy b 
and the conditional expectation of z' given z is 

E(z'\z a ,z b ) = / z'p(z',z)dz' / / p(z',z)dz' =F(z a ,z b ) 

It is not difficult to check that the function F(z a , z b ) is a linear function of its argument 

F{z a ,z b ) = b(z a +z b )/2 

if both the genotype and environment factors obey a normal distribution, in which case, the linearity coefficient is indeed b = or 2 A / (a A + <r|). 
However, even if the parental generation follows a normal distribution, the selected parents do not (equation 7) and the use of parent- offspring 
regression poses even more of a problem than the direct method. 

Segregation density function 

Let po(y) be the distribution of breeding value in the parental generation. In the absence of selection, after recombination- segregation, the 
distribution of breeding value in the progeny is 

Piiy) = §R 2 Po(y a )po(yb)L(y - (y a +y b )/2)dy a dy b (28) 

where the function L(y) is the segregation density function capturing the inheritance process of the breeding value (Karlin 1979). L(y) is 
a probability density function and in the absence of epistasis and dominance effect, its average is zero: J R yL(y)dy = 0. In the infinitesimal 
model framework, L(y) is a normal distribution of variance a\/2 . However, any distribution probability L(y) will lead to a stable, although 
not necessarily normal, probability distribution of breeding values after few round of reproduction. Let us set the origin of the breeding value 
at its average in the parental distribution, i.e. f R ypo(y)dy = 0. In Fourier space relation (28) is 
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p l (k)=pl(k/2)L(k) 

and after n rounds of reproduction, 



n- l 

-2 1 



~p n (k)=pl"(k/2")l[L 2 (k/2>) 

i=Q 

As both p 0 (y) and L(y) are probability distribution functions of zero mean, we have 

p 0 (0) = 1(0) = 1 



#(0)=I'(0) = 0 



and therefore 



Let V = J R y 2 L(y)dy . We see then that 



K(o) = ^Po(o)+ ( 2 -^r)i"(o) 



Var(7„) = - Var(Y 0 ) + I 2 - ^ I V 

So the variance of the breeding values converges fast to twice the variance of the segregation density function. The distribution function p n (y), 
however, converges to a normal distribution only if L(y) is normal. 
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